avg. word length | sentence |
---|---|
12.5714 | International Civil Aviation Organization atawa ICAO ( |
12.4286 | Kapangurusan Bareng Kasenian jeung Budaya Turk ( |
10.2500 | Organisasi-organisasi kasenian tumuwuh subur. |
10.0000 | Warnana coklat kabodas-bodasan atawa kakonéng-konéngan. |
10.0000 | Biasana kagiatanana mangrupa pasanggiri-pasanggiri tingkat SMP/SMA/Satata Sa-Jawa Barat. |
10.0000 | Perdana Mentri Malaysia mangrupikeun kapala pamaréntah Malaysia. |
9.7500 | Ribonukléoprotéin bisa jadi panyalindungan. |
9.7500 | Kurusétra, patempatan Baratayuda lumangsung. |
9.6250 | Cendrawasih Beureum daharna buah-buahan sarta rupa-rupa sarangga. |
9.6250 | Turnix suscitator powelli: Kapuloan Sunda Leutik. |
9.5714 | Subspecies maroccana nyaéta heksaploid sedengkeun cerasiformis tetraploid. |
9.5000 | Kabupatén Lampung Wétan dibentuk numutkeun Undang-Undang No. |
9.4286 | Lolobana wilayah Kabupatén Bandung mangrupa pagunungan. |
9.2857 | Taman Nasional Glacier kawentar ku kaénndahanana. |
9.2857 | Universitas Indonésia dipingpin ku saurang Réktor. |
9.2857 | Angkutan umumna maké angkutan beus jurusan Solo-Karanganyar-Tawangmangu. |
9.2500 | N-asetilglukosamina, hiji turunan glukosa. |
9.1250 | ISBN 979-3631-91-0 Ngajual borondong biasana dibuleud-buleud atawa diemplé-emplé. |
9.1111 | Moldova ngalaksanakeun pamaréntahan républik parleménter nu démokratis jeung répréséntatif. |
9.1111 | Karéta api Fajar jeung Senja Utama Solo ( |
9.1053 | High-Definition Multimedia Interface (HDMI) nyaéta hiji antarbeungeut (interface) sakabéh audio/vidéo digital anu dirojong ku industri sarta henteu dikomprési. |
9.0000 | Masarakat nu nempatan Kampung Ciptagelar disebutna kasepuhan. |
9.0000 | Contona nyaéta lipoprotéin, fosfolipid, jeung fosfatidilkolin. |
9.0000 | Gelombang éléktromagnétik kawangun ku komponén listrik atawa éléktrik jeung komponén magnétik. |
9.0000 | Kabupatén Bogor disawang salaku puseur pamaréntahan karajaan-karajaan éta. |
As in several subsections before, we replace minimal word length by average word length. The table shows the sentences with maximal average word length. Because some languages allow very long words, such sentences may also contain short stopwords. Hence, we may find (at least some) well-formed sentences. Otherwise, we refer to 4.5.2.3.
select avg(char_length(word)) as a, s.sentence from sentences s, inv_w i, words w where s.s_id=i.s_id and i.w_id=w.w_id and length(sentence)>40 and i.w_id>100 group by s.s_id order by a desc limit 30;
4.5.2.1 Maximum word rank in sentence
4.5.2.2 Average word rank in sentence
4.5.2.3 Sentences consisting of many low frequency words I
4.5.2.4 Sentences consisting of many low frequency words II
4.5.2.5 Sentences consisting of short words only I
4.5.2.6 Sentences consisting of short words only II
4.5.2.7 Sentences consisting of long words only I